智能论文笔记

Attention-Aware Anime Line Drawing Colorization

Yu Cao , Hao Tian , P. Y. Mok

分类：计算机视觉 | 人工智能

2022-12-21

Automatic colorization of anime line drawing has attracted much attention in recent years since it can substantially benefit the animation industry. User-hint based methods are the mainstream approach for line drawing colorization, while reference-based methods offer a more intuitive approach. Nevertheless, although reference-based methods can improve feature aggregation of the reference image and the line drawing, the colorization results are not compelling in terms of color consistency or semantic correspondence. In this paper, we introduce an attention-based model for anime line drawing colorization, in which a channel-wise and spatial-wise Convolutional Attention module is used to improve the ability of the encoder for feature extraction and key area perception, and a Stop-Gradient Attention module with cross-attention and self-attention is used to tackle the cross-domain long-range dependency problem. Extensive experiments show that our method outperforms other SOTA methods, with more accurate line structure and semantic color information.

translated by 谷歌翻译

Graph Neural Networks for Low-Energy Event Classification & Reconstruction in IceCube

R. Abbasi , M. Ackermann , J. Adams , N. Aggarwal , J. A. Aguilar , M. Ahlers , M. Ahrens , J. M. Alameddine , A. A. Alves Jr. , N. M. Amin

分类：机器学习

2022-09-07

ICECUBE是一种用于检测1 GEV和1 PEV之间大气和天体中微子的光学传感器的立方公斤阵列，该阵列已部署1.45 km至2.45 km的南极的冰盖表面以下1.45 km至2.45 km。来自ICE探测器的事件的分类和重建在ICeCube数据分析中起着核心作用。重建和分类事件是一个挑战，这是由于探测器的几何形状，不均匀的散射和冰中光的吸收，并且低于100 GEV的光，每个事件产生的信号光子数量相对较少。为了应对这一挑战，可以将ICECUBE事件表示为点云图形，并将图形神经网络（GNN）作为分类和重建方法。 GNN能够将中微子事件与宇宙射线背景区分开，对不同的中微子事件类型进行分类，并重建沉积的能量，方向和相互作用顶点。基于仿真，我们提供了1-100 GEV能量范围的比较与当前ICECUBE分析中使用的当前最新最大似然技术，包括已知系统不确定性的影响。对于中微子事件分类，与当前的IceCube方法相比，GNN以固定的假阳性速率（FPR）提高了信号效率的18％。另外，GNN在固定信号效率下将FPR的降低超过8（低于半百分比）。对于能源，方向和相互作用顶点的重建，与当前最大似然技术相比，分辨率平均提高了13％-20％。当在GPU上运行时，GNN能够以几乎是2.7 kHz的中位数ICECUBE触发速率的速率处理ICECUBE事件，这打开了在在线搜索瞬态事件中使用低能量中微子的可能性。

translated by 谷歌翻译

Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion

Gongjie Zhang , Zhipeng Luo , Yingchen Yu , Jiaxing Huang , Kaiwen Cui , Shijian Lu , Eric P. Xing

分类：计算机视觉

2022-07-28

最近提出的检测变压器（DETR）已建立了一个完全端到端的范式以进行对象检测。但是，DETR遭受慢训练的融合，这阻碍了其对各种检测任务的适用性。我们观察到，由于对象查询和编码图像特征之间的语义不一致，DETR的缓慢收敛在很大程度上归因于将对象查询与相关区域匹配的困难。通过此观察，我们设计了与DETR ++（SAM-DETR ++）设计的语义对齐匹配，以加速DETR的收敛并改善检测性能。 SAM-DETR ++的核心是一个插件模块，该模块将对象查询和编码图像功能投射到相同的功能嵌入空间中，在该空间中，每个对象查询都可以轻松地与具有相似语义的相关区域匹配。此外，SAM-DETR ++搜索了多个代表性关键点，并利用其功能以具有增强的表示能力的语义对齐匹配。此外，SAM-DETR ++可以根据设计的语义对准匹配，以粗到5的方式有效地融合多尺度特征。广泛的实验表明，所提出的SAM-DETR ++实现了优越的收敛速度和竞争性检测准确性。此外，作为一种插件方法，SAM-DETR ++可以以更好的性能补充现有的DITR收敛解决方案，仅使用12个训练时代获得44.8％的AP和49.1％的AP，并使用Resnet-50上的CoCo Val2017上的50个训练时代获得50个训练时期。代码可在https://github.com/zhanggongjie/sam-detr上找到。

translated by 谷歌翻译

Learning Physics from the Machine: An Interpretable Boosted Decision Tree Analysis for the Majorana Demonstrator

I. J. Arnquist , F. T. Avignone III , A. S. Barabash , C. J. Barton , K. H. Bhimani , E. Blalock , B. Bos , M. Busch , M. Buuck , T. S. Caldwell

分类：机器学习

2022-07-21

Majorana示威者是一项领先的实验，寻找具有高纯净锗探测器（HPGE）的中性s中性双β衰变。机器学习提供了一种最大化这些检测器提供的信息量的新方法，但是与传统分析相比，数据驱动的性质使其不可解释。一项可解释性研究揭示了机器的决策逻辑，使我们能够从机器中学习以反馈传统分析。在这项工作中，我们介绍了Majorana演示者数据的第一个机器学习分析。这也是对任何锗探测器实验的第一个可解释的机器学习分析。训练了两个梯度增强的决策树模型，以从数据中学习，并进行了基于游戏理论的模型可解释性研究，以了解分类功率的起源。通过从数据中学习，该分析识别重建参数之间的相关性，以进一步增强背景拒绝性能。通过从机器中学习，该分析揭示了新的背景类别对相互利用的标准Majorana分析的重要性。该模型与下一代锗探测器实验（如传说）高度兼容，因为它可以同时在大量探测器上进行训练。

translated by 谷歌翻译

FewSOL: A Dataset for Few-Shot Object Learning in Robotic Environments

Jishnu Jaykumar P , Yu-Wei Chao , Yu Xiang

分类：计算机视觉 | 人工智能 | 机器学习 | 机器人

2022-07-06

我们介绍了几个弹出的对象学习（LITESOL）数据集，以供对象识别，每个对象有几个图像。我们从不同的视图中捕获了336个现实世界对象，每个对象有9个RGB-D图像。提供对象分割掩码，对象姿势和对象属性。此外，使用330 3D对象模型生成的合成图像用于增强数据集。我们研究了（i）使用我们的数据集的最先进的方法和最新方法，研究了（ii）（ii）使用最先进的方法和元学习的最先进方法的联合对象分割和几乎没有射击分类。评估结果表明，在机器人环境中，对于几个射击对象分类，仍有很大的边距可以改善。我们的数据集可用于研究一组几个弹出的对象识别问题，例如分类，检测和分割，形状重建，姿势估计，关键点对应关系和属性识别。该数据集和代码可在https://irvlutd.github.io/fewsol上找到。

translated by 谷歌翻译

Applications of Generative Adversarial Networks in Neuroimaging and Clinical Neuroscience

Rongguang Wang , Vishnu Bashyam , Zhijian Yang , Fanyang Yu , Vasiliki Tassopoulou , Lasya P. Sreepada , Sai Spandana Chintapalli , Dushyant Sahoo , Ioanna Skampardoni , Konstantina Nikita

分类：机器学习 | 计算机视觉

2022-06-14

生成的对抗网络（GAN）是在众多领域成功使用的一种强大的深度学习模型。它们属于一个称为生成方法的更广泛的家族，该家族通过从真实示例中学习样本分布来生成新数据。在临床背景下，与传统的生成方法相比，GAN在捕获空间复杂，非线性和潜在微妙的疾病作用方面表现出增强的能力。这篇综述评估了有关gan在各种神经系统疾病的成像研究中的应用的现有文献，包括阿尔茨海默氏病，脑肿瘤，脑老化和多发性硬化症。我们为每个应用程序提供了各种GAN方法的直观解释，并进一步讨论了在神经影像学中利用gans的主要挑战，开放问题以及有希望的未来方向。我们旨在通过强调如何利用gan来支持临床决策，并有助于更好地理解脑部疾病的结构和功能模式，从而弥合先进的深度学习方法和神经病学研究之间的差距。

translated by 谷歌翻译

Sample-Efficient Generation of Novel Photo-acid Generator Molecules using a Deep Generative Model

Samuel C. Hoffman , Vijil Chenthamarakshan , Dmitry Yu. Zubarev , Daniel P. Sanders , Payel Das

分类：机器学习

2021-12-02

光酸产生剂（PAG）是在暴露于光线时释放酸（$ H ^ + $离子）的化合物。这些化合物是用于制造半导体逻辑和存储芯片的光刻工艺的关键组分。半导体需求的指数增加突出了发现新型光酸发生器的需求。虽然De Novo分子设计使用深度生成模型被广泛用于药物发现和材料设计，但其在创建新颖的光酸发电机的应用构成了几个独特的挑战，例如缺乏房地产标签。在本文中，我们突出了这些挑战，并提出了一种生成的建模方法，该方法利用预先训练的深度自动化器和循环技术的条件生成。在主题专家的帮助下评估了拟议方法的有效性，表明在创建新型光酸生成器之外的应用方法的承诺。

translated by 谷歌翻译

Spatio-Temporal Scene-Graph Embedding for Autonomous Vehicle Collision Prediction

Arnav V. Malawade , Shih-Yuan Yu , Brandon Hsu , Deepan Muthirayan , Pramod P. Khargonekar , Mohammad A. Al Faruque

分类：计算机视觉

2021-11-11

在自动车辆（AVS）中，预警系统依赖于碰撞预测，以确保乘员安全。然而，使用深度卷积网络的最先进的方法在建模冲突中失败或者太昂贵/慢，使它们不太适合在AV边缘硬件上部署。为了解决这些限制，我们提出了SG2VEC，一种使用图形神经网络（GNN）和长短期内存（LSTM）层的时空场景图嵌入方法，以通过视觉场景感知来预测未来的碰撞。我们展示SG2VEC预测碰撞8.11％，比综合数据集的最新方法提前39.07％，在挑战现实世界碰撞数据集中更准确地进行29.47％。我们还表明，SG2VEC在将知识从合成数据集转移到现实世界驾驶数据集时更好。最后，我们证明SG2VEC更快地执行推论9.3X，较小的型号为88.0％，功率少32.4％，而且能量少于行业标准的NVIDIA驱动PX 2平台，制作它更适合在边缘实施。

translated by 谷歌翻译

Heterogeneous Graph Attention Network

Xiao Wang , Houye Ji , Chuan Shi , Bai Wang , Peng Cui , P. Yu , Yanfang Ye

分类：

2019-03-18

Graph neural network, as a powerful graph representation technique based on deep learning, has shown superior performance and attracted considerable research interest. However, it has not been fully considered in graph neural network for heterogeneous graph which contains different types of nodes and links. The heterogeneity and rich semantic information bring great challenges for designing a graph neural network for heterogeneous graph. Recently, one of the most exciting advancements in deep learning is the attention mechanism, whose great potential has been well demonstrated in various areas. In this paper, we first propose a novel heterogeneous graph neural network based on the hierarchical attention, including node-level and semantic-level attentions. Specifically, the node-level attention aims to learn the importance between a node and its metapath based neighbors, while the semantic-level attention is able to learn the importance of different meta-paths. With the learned importance from both node-level and semantic-level attention, the importance of node and meta-path can be fully considered. Then the proposed model can generate node embedding by aggregating features from meta-path based neighbors in a hierarchical manner. Extensive experimental results on three real-world heterogeneous graphs not only show the superior performance of our proposed model over the state-of-the-arts, but also demonstrate its potentially good interpretability for graph analysis.

translated by 谷歌翻译

Theoretically Principled Trade-off between Robustness and Accuracy

Hongyang Zhang , Yaodong Yu , Jiantao Jiao , Eric P. Xing , Laurent El Ghaoui , Michael I. Jordan

分类：

2019-01-24

We identify a trade-off between robustness and accuracy that serves as a guiding principle in the design of defenses against adversarial examples. Although this problem has been widely studied empirically, much remains unknown concerning the theory underlying this trade-off. In this work, we decompose the prediction error for adversarial examples (robust error) as the sum of the natural (classification) error and boundary error, and provide a differentiable upper bound using the theory of classification-calibrated loss, which is shown to be the tightest possible upper bound uniform over all probability distributions and measurable predictors. Inspired by our theoretical analysis, we also design a new defense method, TRADES, to trade adversarial robustness off against accuracy. Our proposed algorithm performs well experimentally in real-world datasets. The methodology is the foundation of our entry to the NeurIPS 2018 Adversarial Vision Challenge in which we won the 1st place out of ~2,000 submissions, surpassing the runner-up approach by 11.41% in terms of mean 2 perturbation distance.

translated by 谷歌翻译